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Abstract 

Mobile service systems offer useful information to users through mobile devices. Mobile service 
systems have the capability of effectively mining a special request from abundant data based on 
user movement behavior patterns. UMBPs which includes U(mobile user), L(location), 
T(timestamp) and S(service request). Effective mining can be performed by introducing standard 
graph-matching algorithms along with the primitives of a database management system, which 
comprises grouping, sorting, and joining operations. Moreover, by mining the associated 
structure via maximum weight bipartite graph matching, a prediction mechanism, based on the 
model of UMBPs, is used to find strong relationships among U, L, T and S. This paper aimed at 
mining mobile User Movement Behavior Patterns such that movement locations associated with 
suitable services could be predicted and recommended for users. 
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I. INTRODUCTION 

The advances in wireless communications and mobile device technologies not only accentuate 
various wireless communications applications but also enable the provision of plentiful kinds of 
mobile services for users[3],[6]. In mobile service environments, mobile users may request 
various kinds of services and applications through mobile devices. 

To achieve quick response from the system, the four characteristics are used in mobile services 
environment. This is called as User Movement Behavior Patterns (UMBP). The UMBP 
characteristics are mobile Users (U), movement Locations (L), Timestamps (T), service 
requests(S).Here data mining is to be used to an efficient and excellent performance result in 
mobile service responses. The project contains both spatial and temporal mining, also data 
mining approach is proposed for efficiently discovering UMBPs in a mobile service 
environment. Traditional mobile service systems are inadequate in handling complex UMBPs 
without taking U (mobile users), L (movement locations), T (dwell time in timestamps), and S 
(service requests) into serious consideration. 

Over the past few years, some studies have employed data mining techniques to discover 
interesting patterns from Web logs or large databases [1]. Although some recent studies have 
made progress on data mining in mobile service systems, they were mostly focused on issues 
such as moving path mining or service request log mining. Mainly faced the problems of location 
tracking and resource allocation. Moreover, by modeling UMBPs via maximum weight bipartite 
graph matching [9], this prediction mechanism provides appropriate recommendations for users 
in terms of the spatial movement locations of each mobile user with requested services in 
different time intervals. Nevertheless, for acquiring an appropriate recommendation and a precise 
prediction, the four relational patterns of U, L, T, and S must be considered simultaneously. 
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Fig: Example of the deployed locations and user movements 



Fig. shows that there are ten different locations are scattered randomly in a spatial area. In 
addition, five users are denoted with different trajectories and with different plotted line styles. 

A data mining approach is proposed for efficiently discovering user movement behavior 
patterns in a mobile service environment, namely matching mobile access pattern, which consists 
of user movement along with requested services in different time intervals. The mobile access 
patterns consists of three databases, User Movement database (UMD), Location Movement 
Database (LMD), Service Request Database (SRD). The UMD, which is a 2-D database, is 
considered the original raw database. The LMD is obtained after validation and conversion 
mapping from the UMD. The SRD is used to store the service request sequences of mobile users 
into a structure that is suitable for data mining. A family of match join weights yields a 
maximum match by MJSC. In addition, this associated tree -based technique is used to determine 
the relationships among data in the structure. An associated tree-based pattern, extracted from 
candidate cardinality matching tuples, is obtained based on the given minimum support value. 

The rest of the paper deals with the following sections: Section II discuss about a review of 
moving objects in the database. In section IV, both match join algorithms MJMF and MJSC are 
described. Modules described in Section V. Finally, conclusions are presented in Section VII. 



The various terminologies used are : 
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Time vertical — the total duration the data have been captured. 

Movement— the mobile user moves from one base station to another base station. 

Interesting location— the mobile user stays at a location longer than the predefined maximum 

duration. 

Generic location— the mobile user stays at a location shorter than the predefined maximum 
duration. 

T m is the ID number of the stamp, Ui is the ID number of the user, Lj is the ID number of the 
location.T s is the duration that mobile users dwell at location Ui. S n = {si, S2, S3,...,s n } is the set 
of services requested by mobile users. Each element in S n is given a designated optimum time of 
using this service. If a mobile user uses the service for a longer duration than the optimum time, 
then the service is regarded as an interesting service. Each element in S n is the ID number of the 
service; T r is denoted as the duration of using a service. 



Table: Example of raw data for MJSC 
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III. BASIC DEFINITIONS 
The match joins of matching mobile access patterns from the three databases, UMD, LMD, 
and SRD. The basic definitions are given as follows: 
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Definition 1 — [Mobile User]: U = {ul, u2, u3, . . . , ui} is the set of mobile users. Each mobile 
user represents a person who carries a mobile device that has the capability of receiving services 
from the mobile environment, can be identified and tracked. 

Definition 2 — [Location]: Two special locations used to identify the regularities of the visiting 
locations are the generic location and the interesting location. The generic location is a collective 
term for one or more interesting locations, and the interesting location is a subset of the generic 
location. The generic location can be defined as L = {11, 12, 13, ... , lj}, where each element Ij 
represents a generic location. The interesting location can be defined as the user ui staying at a 
location li longer than the maximum duration. 

Definition 3 — [Timestamp and Maximum Duration]: The timestamp Tm have an equal period 

and a uniform unit. The maximum duration is considered to be 30 min, in general. 

Definition 4 — [Service]: S = {si, s2, s3, . . . , sn} is the set of services requested by mobile users. 

Each element represents an individual service ID. An optimum time is set for each service 

requested. 

Definition 5 — [Weight Value w-Join]: The weight value w is used to identify the importance of 
an extracted pattern. The w is dependent on three factors, the weight values of match joins 



u t>a l, lmt and Tm5. 



weight value of U t>a l = [LCi/Maxf, 

LC 2 /Maxf, , LQ/Maxf] 

weight value of L >a X = [Tsi/d Ma x, 

TS2/djy[ax, , Ts m /dMax] 

weight value of rws = [Tri/Oti, 

Tr 2 /Ot 2 , , Tn/Oti] 

LCi is the count of arriving at location li for each user. 
Maxf is the maximum LCi for each user. 
Tsi is the duration a mobile user dwells at location li. 
dMax is the maximum Tsi. 

Tri is the duration of requested service at each location li. 
Oti is the given optimum time of using the z'th service. 
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IV. MATCHING MOBILE ACCESS PATTERNS 



Mining matching UMBPs can be done by operating on relational flows to generate the max flow 
of match joins by means of MJMF. The transformation from a matching problem to the max 
flow problem can be divided into three phases. They are grouping nodes, building the reduced 
graph, and exercising the max flow algorithm. The first step in the grouping node is to find 
tuples whose relations have different weight values on the join columns. Second, the weight 
values of match joins u >a i, l <^ t and t c-a s are to be calculated. With the weight values 
and the designated min_sup value, the relative intensity(RI) can be carried out to construct the 
reduced max flow graph. 

MJSC computes the match join of the four input relations by first dividing the relations into 
groups of candidate matching tuples of U, L, T, and S, and, second, by operating the match join 
within each group. The steps of the algorithm are as follows: 

Step 1: Perform an external sort of four input relations on all attributes involved with w. 

Step 2: Iterate through the relations and generate the next group G of the tuples of U, L, T, and S. 

Step 3: Within G, combine the four subsets of U, L, T, and S tuples. Compared to the operation 

of combine-join, the iterators operating in the table can be advanced as soon as matches are 

found. 

To determine the relative intensity the related parameters of three sorts, M(ui, lj), M(lj, tm), 
and M(tm, sn), which correspond to al, al, a3 values have to be given in advance. The three 
parameters follow the relation of al + al + cc3 = 1. Second, the weights of the join U LTS are 
normalized to identify a useful relation, which corresponds to the weight value. If the weight 
value is greater than or equal to the min_sup value, then the relation is denoted as useful. Then 
the RI is calculated with related parameters al, al, and a3. Finally, according to different users, 
grouped these tuples (ui, lj, tm, sn) into /-groups. The operation through the table within a group 
can then retrieve the maximum match. As a result, each group has its maximum match based on 
the RI. Based on the RI of match join weights w, the maximum flow can be obtained by MJSC. 

The RI can be determined as follows: 
RI = El(ui, lj) x al + E2(lj, tm) x al + E3(tm, sn) x a3. 

The min_sup value, which has been set to 0.2 in advance. Meanwhile, al, a2, and a3 are the 
related parameters of edges (ui, lj), (lj, tm), and (tm, sn), whose values are 0.2, 0.4, and 0.4, 
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respectively. El(ui, lj), E2(lj, tm), and E3(tm, sn) are the corresponding normalized weight 
values of (ui, lj), (lj, tm), and (tm, sn), respectively. 



V. MODULES DESCRIPTION 



A. Mobile Network Model 



Mobile Networks refer to a family of routing protocols developed to route traffic through 
mobile wireless networks. These networks place special requirements on routing protocols due 
the unpredictable nature of the radio links and changing network topology due to node mobility. 
As the mobile user movement locations are sampled over a long-term historical data, the 
movements of the mobile user are tracked as an n-length sequence S of spatial locations of the 
form {( li,ti) , ( l2,t2) ,...,( lj,tj) }, where lj is the j -location and tj is the time the mobile user 
arrives at the j -location. 
B. Mobile Transaction Sequence Dataset 

In a mobile transaction database, users in the different user groups may have different mobile 
transaction behaviors. The user give the information i.e. location, time, type of service, user Id to 
the mobile service station for getting the requested service. We assume that the user movement 
database (UMD) is the raw database for mobile access patterns mining. It consists of a 2- 
dimensional database in which each column represents the individual mobile user and each row 
represents individual time, and the duration between each time unit represents an equal duration, 
whether it is one minute, or one hour. 



C. Maximum Join Sort Combination(MJSC) 



Match joins using sort combination (MJSC) that computes the match join of the four relations 
by first dividing up the relations into groups of candidate matching tuple of U, L, T, and S and 
then computing the match join within each group. 

The main steps of the algorithm are as follows: 

1. Perform an external sort of four input relations on all attributes involved in w. 

2. Iterate through the relations and generate the next group G of U, L, T, and S tuple. 
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3. Within G, combine the four subsets of U, L, T, and S tuple, just as in combine join, except 
that iterators on the table can be advanced as soon as matches are found. The related intensity of 
a conventional sort combined join is proportional to the product of its input relations. The 
weightage values of the dataset is to be calculated such as wl,w2,w3. Then calculate the relative 
intensity. 

D. Service Response and Performance Analysis 

According to the weight values wl.w2,w3 and its relative intensity value the service is to be 
provided. The services are provided to the particular user based on high relative intensity values. 

When compared to the existing system, the proposed system has less execution time with 
more candidate data set. When the min_ sup value of the generated candidate data sets increased 
from 0.2% to 1%, the size of the candidate data sets decreased from 50 to 10. 



E. Block Diagram 
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VI. EXPERIMENTAL RESULTS 



Simulations were conducted as follows. The objective of this simulation was to compare the 
performance of the proposed algorithms with the Apriori-like algorithm and evaluate their 
sensitivity by varying the parameters of various data characteristics. The change in the strong 
relationships among these nodes (ui, lj, tm, sn) was examined. Then, the support values of match 
joins were compared with different candidates, including the min_sup value and the number of 
nodes obtained with the length of k-tree candidates. Finally, the performance of the three 
synthetic databases with parameters in varying degrees was tested. 

In the simulation, generated 5-10 mobile users and each mobile user had a distinct movement 
path. To simulate the movement patterns of the mobile users, the probability model was utilized 
to model user movement behaviors. Each mobile user had a unique movement behavior, it means 
that the user would frequently move in certain locations and request certain services. In given 
locations where a mobile user usually stayed and moved among the locations, movement paths, 
denoted as the shortest paths, for the user were generated. To generate the movement patterns of 
the mobile users, the approximate locations for the maximal frequent patterns needed to be 
determined . As the number of users and locations were inputted, random maximal patterns were 
selected, and several random trajectories were generated in space. Thus, the maximal frequent 
patterns were obtained according to the preset parameter Maxfi to determine the maximal 
frequency of each user arriving at locations each day. To generate the dwell time in timestamps 
T, the parameter Tn was used to determine the number of timestamp patterns. Dwell time 
patterns were generated randomly; each dwell time pattern was attached to timestamps T. Each 
mobile user had a location sequence S, which was a randomly consecutive location. The dwell 
time pattern was a subsequence of S. To generate service request patterns, each mobile user 
requested services in different locations, and the time of each service request pattern was 
generated randomly. Each service request in a location contributed to its dwell time of using each 
service pattern in timestamps T. Given the number of services, the parameter Maxfs was used to 
determine the maximal frequency of each user requesting services each day, and the parameter 
Oti was used to determine the optimum time of using each service. 

It was observed that the proposed approach obtained more candidate patterns than the Apriori- 
like algorithm. The proposed algorithms is much better than the Apriori-like algorithm. That is, 



A Monthly Double-Blind Peer Reviewed Refereed Open Access International e-Journal - Included in the International Serial Directories 
Indexed & Listed at: Ulrich's Periodicals Directory ©, U.S.A., ItMJiPBtfyf j as well as in Cabell's Directories of Publishing Opportunities, U.S.A. 



International Journal of Management, IT and Engineering 
http://www.ijmra.us 





Fig. Execution time in response to changes in different minimum support values. 




Minimum Support (%) 



Fig. Size of candidate datasets in response to changes in different minimum support values. 



VII. CONCLUSION 

The proposed method was based on User Movement Behavior Patterns(UMBPs).The UMBPs 
can be regarded as the match joins of U, L,T and S based on their weight values. The simple 
approach of computing the full w-joins and then applying standard graph-matching algorithms 
and the DBMS primitives of grouping, sorting, and joining could be utilized to yield efficient 
match join operations. Finally, through experimental evaluation under various simulation 
conditions, and using synthetically generated data, the proposed methods produced excellent 
performance results in terms of execution efficiency and scalability. This paper aimed at mining 
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mobile UMBPs such that movement locations associated with suitable services could be 
predicted and recommended for users. 
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